Emergence of a small world from local interactions: 
Modeling acquaintance networks 
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How does one make acquaintances? A simple observation from everyday experience is that often one 
of our acquaintances introduces us to one of his acquaintances. Such a simple triangle interaction 
may be viewed as the basis of the evolution of many social networks. Ifere, it is demonstrated that 
this assumption is sufficient to reproduce major non-trivial features of social networks: Short path 
length, high clustering, and scale-free or exponential link distributions. 
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A remarkable feature of many complex systems is the 
occurrence of large and stable network structures, as, for 
example, networks on the protein or gene level, ecologi- 
cal webs, communication networks, and social networks 
[Q-^. Already simple models based on disordered net- 
works arc quite successful in describing basic properties 
of such systems. When addressing topological proper- 
ties, however, neither random networks nor regular lat- 
tices provide an adequate framework to model charac- 
teristic features. A helpful concept along this line is the 
idea of "small-world networks" introduced by Watts and 
Strogatz 1^,^, which initiated an avalanche of scientific 
activity in this field ||6|-[rT[|. Small- world networks inter- 
polate between the two limiting cases of regular lattices 
with high local clustering and random graphs with short 
distances between nodes. High clustering means that, 
if node A is linked to node B, and B is linked to node 
C, there is an increased probability that A will also be 
linked to C. Another useful measure is the distance be- 
tween two nodes, defined as the number of edges along 
the shortest path connecting them. A network is called 
a "small-world network" if it exhibits the following two 
characteristic properties [p[p^: (i) high clustering; and 
(ii) a small average shortest path between two random 
nodes (the diameter of the network), scaling logarithmi- 
cally with the number of nodes. Thus, any two nodes in 
the network are connected through only a small number 
of links. The most popular manifestation of a small world 
is known as "six degrees of separation" , a postulate by 
the social psychologist Stanley Milgram stating that 
most pairs of people in the United States can be con- 
nected through a path of only about six acquaintances 

Let us here focus on social networks, and acquaintance 
networks in particular, which are typical examples for 
the small- world property [p],p|jT^] . First of all, what does 
the concept of "small-world networks" tell us about real 
world systems? In its original definition it served as 
an elegant toy model, demonstrating the consequences of 
high clustering and short path length. However, as these 
networks are derived from regular graphs, their applica- 
bility to real world systems is very limited. In particular. 



how a network in a natural system forms a small-world 
topology dynamically, often starting from a completely 
random structure, remains unexplained. The main goal 
of this paper is to provide one possible answer to this 
problem. 

A similar problem of dynamical origin is faced (and 
much progress has been made) in a different, but not 
completely unrelated field: The dynamics of scale-free 
networks. Scale-free properties are commonly studied in 
diverse contexts from, e.g., the stability of the internet 
|l5| to the spreading of epidemics , and are observed 
in some social networks The origin of scale- 

free properties is well understood in terms of interactions 
that generate this topology dynamically, e.g., on the basis 
of network growth and preferential linking . While 

these models generate scale-free structures, they do not, 
in general, lead to clustering and are therefore of limited 
use when modeling social networks. 

In this paper, an attempt is made to unify ideas from 
the two worlds of "small-world networks" and "scale- 
free networks" which may help understanding social net- 
works, and how a small- world structure can emerge dy- 
namically. In particular, a simple dynamical model for 
the evolution of acquaintance networks is studied. It gen- 
erates highly clustered networks with small average path 
lengths which scale logarithmically with network size. 
Furthermore, for small death-and-birth rates of nodes 
this model converges towards scale-free degree distribu- 
tions, in addition to its small-world behavior. Basic in- 
gredients are a local connection rule based on "transitive 
linking", and a finite age of nodes. 

To be specific, let us formulate a model of an acquain- 
tance network with a fixed number N of nodes (as per- 
sons) and undirected links between those pairs of nodes 
that represent people who know each other. Acquain- 
tance networks evolve, with new acquaintances forming 
between individuals, and old ones dying. Let us assume 
that people are usually introduced to each other by a 
common acquaintance and that the network is formed 
only by people who are still alive. The dynamics is de- 
fined as follows: 

(i) One randomly chosen person picks two random ac- 
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quaintances of his, and introduces them to each another. 
If they have not met before, a new Unk between them is 
formed. In case the person chosen has less than two ac- 
quaintances, he introduces himself to one other random 
person. 

(ii) With probabihty p, one randomly chosen person is 
removed from the network, including all links connected 
to this node, and replaced by a new person with one 
randomly chosen acquaintance. 

These steps are then iterated. Note that the number 
of nodes remains constant, neglecting fluctuations in the 
number of individuals in acquaintance networks. The fi- 
nite age implies that the network reaches a stationary 
state which is an approximation of the behavior of many 
social networks, and is in contrast to most models based 
on network growth |p|, p^j20| . The probability p deter- 
mines the separation of the two timescales in the model. 
In general, the rate at which people make social contacts 
can be as short as minutes or hours, while the timescale 
on which people join or leave the network may be as long 
as years or decades. In the following, we therefore focus 
on the regime p ^ 1. 

Once the network reaches a statistically stationary 
state, one of its characteristic quantities is the degree 
distribution P{k) of the network. In Fig. |l|, the degree 
distribution is shown for different values of p. 




FIG. 1. The degree distribution P{k) of the transitive 
linking model in the statistically stationary state. The dis- 
tribution exhibits a power-law regime for small p, with an 
exponent of 1.35 for p — 0.0025. Note that the distribution is 
largely insensitive to system size A'', which here is A'' = 7000. 
The exponential cutoflt is a result of the finite age of nodes. 

Due to the limited lifetime of persons in the network, 
the observed numbers of acquaintances of different per- 
sons do not grow forever, but rather fall into some finite 
range. This can be seen in the degree distribution where 
the cut-off at high k results from the finite age of nodes. 
In the regime p <C 1, the degree distribution is dominated 



by transitive linking (i), resulting in a power-law range 
which increases in size with decreasing p. For larger val- 
ues of p, the Poissonian death process (ii) competes with 
the transitive linking process (i) , resulting in a stretched 
exponential range in the degree distribution until, in the 
case p « 1, the random linking of (ii) dominates with its 
Poissonian dynamics. Therefore, the above model gen- 
erates degree distributions spanning scale-free and expo- 
nential regimes that are also observed in the statistics of 
social networks. For large enough network size N, the 
specific distribution P{k) only depends on the single free 
parameter of the model p. From experimental data one 
can estimate p as well, which is typically very small p <^ 1 
such that the two timescales of the network dynamics are 
well separated. 

As already noted above, "small-world networks" are 
characterized by a high degree of clustering C and a small 
average path length £ which scales logarithmically with 
the number of nodes. The degree of clustering is mea- 
sured by the clustering coefficient defined as follows: For 
a distinct node i, the clustering coefficient Ci is given 
by the ratio of existing links Ei between its ki neighbors 
to the possible number of such connections ■^ki{ki — 1). 
Then the clustering coefficient C of the network is defined 
as the average over all nodes 

In Table |, the clustering coefficient of the above model 
is shown for different values of p. 

TABLE I. Clustering coefficient for different values of p 
and a network size of A' = 7000. C' is an upper bound for 
the average clustering coefficient of a network with the same 
degree distribution without transitive linking. Crand is the 
clustering coefficient of a random network of the same size 
and with the same average degree (fc). 



p 


(k) 




C 


C 


Crand 


0.04 
0.01 
0.0025 


14.9 
49.1 
149.2 


912 
13744 
99436 


0.45 
0.52 
0.63 


0.036 

0.29 

0.43 


0.0021 
0.0070 
0.021 
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In comparison to the clustering of a random network 
with same size and same mean degree Grand, the model 
coefficient C is consistently of much larger size. The clus- 
tering coefficient of a random network, i.e. a network 
with constant probability of linking each pair of nodes 
Piink = {k)/iN — l) and, therefore, a Poisson distribution 
of the node degree, is just this probability Crand = PUnk- 
Obviously, Crand IS proportional to the mean degree {k) 
for constant network size. For further comparison, let us 
derive an estimate C for an upper bound of the average 
clustering coefficient of a network with the same degree 
distribution, but randomly assigned links. Thus, C pro- 
vides an upper bound on the average clustering which one 
would expect solely from the degree distribution while ne- 
glecting transitive linking of the model. Using the gener- 
ating function approach for graphs with arbitrary degree 
distributions [pT| and assuming that fluctuations of the 
average degree of the neighborhood of a node can be ne- 
glected, an upper bound C" can be derived in terms of 
the first two moments: 



c = 



1 



{k)N V (fc) 



(2) 



This result holds exactly in the case of the Poissonian de- 
gree distribution of a random network {Crand — C). As 
Table | shows, the network generated by the model ex- 
hibits an even higher average clustering coefficient than 
a network with links distributed randomly according to 
the same degree distribution. In particular, the network 
is much more clustered than a random network with a 
Poisson distribution as required for the small- world prop- 
erty. Furthermore, the clustering is not as much depen- 
dent on the mean degree (fc) as C of a network with the 
same degree distribution but no transitive linking. 

The scaling of the average path length with system size 
is shown in Fig. |^. 
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FIG. 2. The average path length t.{N) as a function of sys- 
tem size TV on a semilogarithmic scale (p — 0.04). The data 
are in good agreement with a logarithmic fit (straight line). 



The data are consistent with a logarithmic behavior 
and, thus, our model meets the second requirement for a 
small-world behavior, as well. Also, this is what one ex- 
pects in the framework of a random graph with arbitrary 
degree distribution Moreover, we can compare the 
average path length of our model with the path length 
of a network with the same degree distribution and ran- 
domly assigned links. The generating function approach 
can again be used to estimate the average path length £ 
of such a network. For a broad distribution this leads to: 



For the Poisson distribution of a random network, 
obtains: 



'-rand 



\ogN 



(3) 



one 



(4) 



Note that only N, (k) and (fc^) are used to estimate £ 
as also used in the derivation of With the help of 
Table | one finds that £' « 1.59 and £rand ~ 1-77 for 
p — 0.0025. Numerical simulations of our model similarly 
yield a very short path length oi£ = 2.38 which is further 
evidence that the simple linking rule of our model leads 
to small-world behavior. Intuitively, t' < £rand follows 
from the highly connected "hubs" present in the scale- 
free networks for p = 0.0025. The fact that £ is slightly 
larger than irand is due to the fact that in step (i) of 
the model, many links are used to build highly clustered 
neighborhoods. This price to pay for clustering, however, 
only slightly affects the small overall mean path length. 

One example for an observed small-world effect is the 
network of coauthorships between physicists in high en- 
ergy physics [0. Nodes are researchers who are con- 
nected if they have co-authored a paper. In a recent 
study of the publications in the SPIRES database over 
the five year period 1995-1999, a graph was reconstructed 
from the data and analyzed jlj,^. The resulting net- 
work consists of 55, 627 nodes with a mean degree (fcs) — 
173, a mean shortest path length £5 — 4.0, and a very 
high clustering coefficient Cs = 0.726. The degree distri- 
bution is consistent with a power-law of exponent —1.2 
|]l7| . From these data one can derive C" and £' for a net- 
work with the same degree distribution but random links 
to C = 0.19 and i' = 1.81. These numbers show that 
the real network exhibits clustering which is very much 
higher than would be expected from the degree distribu- 
tion alone. Also, the path length is short but still larger 
than for a randomly linked network of the same degree 
distribution. Using the logarithmic scaling for £, the data 
of the example agree with the values of the above model. 
The basic assumptions made in the model are met by 
the dataset, as the number of researchers in the sample 
is to a good approximation stationary, and as the small 
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rate of researchers entering or leaving the system during 
the time frame of the sample justifies the regime of small 

The previous example demonstrates, how our model 
can be applied to a social network in the dynamically 
stationary state. Moreover, the model studied here is 
also able to accommodate other small-world scenarios, 
e.g., without a scale-free degree distribution as the 
particular shape of the distribution varies, depending on 
the turnover rate p. The question of the origin of small- 
world behavior in social systems has led to many different 
approaches as well. In an interesting model Mathias and 
Gopal showed that a small world topology can arise 
from the combined optimization of network distance and 
physical distance. Applications of this principle, how- 
ever, are more likely to be found in the field of trans- 
portation networks rather than acquaintance networks 
considered here. A more similar approach to the study 
presented here has been taken by Jin et al. |Q, who 
study a model which shares a mechanism similar to tran- 
sitive linking as defined here. Otherwise, however, it is 
more complicated than we feel it needs to be, at least for 
some classes of social networks. Also its upper limit on 
the degree of a node, motivated by one specific trait of 
some acquaintance networks, otherwise makes the model 
less suitable to meet experimental data of social networks 
which exhibit broad degree distributions. 

In conclusion, a simple dynamical model for the emer- 
gence of network structures in social systems has been 
studied. It is based on a local linking rule which con- 
nects nodes who share a common neighbor, as well as on 
a slow turnover of nodes and links in the system. The 
network approaches a dynamically stationary state, with 
high clustering and small average path lengths which 
scale logarithmically with system size. Depending on a 
single free parameter, the turnover rate of nodes, this 
model interpolates between networks with a scale-free 
and an exponential degree distribution, both of which 
are observed in experimental data of social networks. 
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